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Abstract  -  This  paper  deals  with  the  multiperspective 
recognition  technique  applied  to  the  computer-aided  decisions 
in  medicine.  For  three  different  concepts  of  multiperspective 
classification,  i.e.  direct,  decomposed  independent  and 
decomposed  dependent  approach,  several  decision  algorithms 
are  presented.  They  are:  probabilistic  (empirical  Bayes) 
algorithm,  nearest  neighbour  algorithm,  fuzzy  method  and 
artificial  neural  network  of  the  back  propagation  and  counter 
propagation  types.  Proposed  methods  and  algorithms  have 
been  applied  to  the  computer-aided  diagnosis  of  chronic  renal 
failure  and  decisions  in  non-Hodgkin  lymphoma.  Results  of 
experimental  investigations  on  the  real  data  and  outcomes  of 
the  comparative  analysis  of  discussed  algorithms  are  presented. 
Keywords  -  Multiperspective  recognition,  probabilistic 
methods,  neural  network,  fuzzy  approach,  chronic  renal 
failure,  non-Hodgkin  lymphoma. 

I.  Introduction 

One  of  the  most  often  used  instruments  for  the 
computer-aided  medical  diagnosis  are  pattern  recognition 
methods.  This  is  because  the  problem  of  classification  of  a 
given  pattern  is  identical  to  the  task  of  medical  diagnosis 
and  therefore  the  methods  and  algorithms  found  in  this 
theory  are  directly  employed. 

The  classical  pattern  recognition  problem  is  concerned 
with  the  assignment  of  a  given  pattern  to  one  and  only  one 
class  from  a  given  set  of  classes.  Multiperspective 
classification  (MC)  problem,  which  idea  has  been  presented 
in  [1]  refers  to  a  situation  in  which  an  object  undergoes 
several  classification  tasks.  Each  task  denotes  recognition 
from  a  different  point  of  view  and  with  respect  to  different 
set  of  classes.  For  example,  such  a  situation  is  typical  for 
compound  medical  decision  problems  where  the  first 
classification  denotes  the  answer  to  the  question  about  the 
kind  of  disease,  the  next  task  states  recognition  of  the 
stadium  of  disease,  the  third  one  determines  the  kind  of 
therapy,  etc. 

This  work  presents  several  different  approaches  to 
algorithmization  of  the  multiperspective  diagnosis  task  and 
the  implied  decision  algorithms.  The  following  approaches 
will  be  explored  in  turn:  the  sample-based  probabilistic 
approach,  the  nearest  neighbour  algorithm,  the  fuzzy  system 
and  artificial  neural  networks.  It  should  be  noted  that  for 
each  of  the  four  cases  a  set  of  diagnostic  algorithms  will  be 
proposed.  The  way  and  scope  of  effective  usage  of  the 
available  data  will  be  different  for  each  set,  as  will  the 
formal  model  for  the  corresponding  decision  task  be.  We 
will  also  perform  an  experimental  comparative  analysis  of 
the  proposed  algorithms  for  real  data  that  concern  the 
diagnosis  of  chronic  renal  failure  and  decision  problems  in 
non-Hodgkin  lymphoma.  The  perforated  experimental 


research  will  enable  us  to  evaluate  the  presented  conceptions 
along  with  usefulness  of  the  proposed  algorithms,  as  well  as 
recommend  them  for  possible  practical  applications. 

H.  MULTIPERSPECnVE  RECOGNITION 

Let  us  consider  ^perspective  recognition  problem.  Let 
x/^e  X  /c  and  /^e  M  ^  denote  the  vector  of  features  and  the 
class  number  for  the  k- th  recognition  task,  respectively. 

In  the  case  of  MC  we  can  in  different  manner  define  the 
action  of  recognizer  which  leads  to  the  following 
recognition  concepts  [2]: 

Direct  approach:  a  decision  rule  T*  of  MC  maps  the  space  of 
entire  feature  vector  X  to  the  product  M  t  x  M  2  x  ...x  M  n  ■ 
It  means  that  now  MC  is  a  single  activity  in  which  a  given 
pattern  is  classified  into  classes  simultaneously  for  all 
recognition  tasks. 

Decomposed  independent  approach:  an  object  is  classified 
in  a  succesive  manner  for  particular  recognition  tasks  and 
the  decision  algorithm  yV/c  for  k- th  classification  uses 
features  x k  only,  viz:  T^:  X  £  — »  M  £  This  concept  denotes 
decomposition  of  MC  problem  on  ^independent  decisions 
which  do  not  take  into  account  the  fact  that  they  state  steps 
of  the  whole  compound  decision  process. 

Decomposed  dependent  approach:  an  object  is  classified  in  a 
successive  manner  for  particular  recognition  tasks  and  now 
the  decision  algorithm  for  the  k- th  classification  uses  not 

only  features  xfc  but  additionally  the  results  of  previous 
classifications,  viz.,  VF^:  X^.  xM1xM2x...xMA,  —>  M 
This  concept  similarly  denotes  the  decomposition  of  the 
whole  MC  problem  but  now  the  successive  classification 
tasks  are  mutually  dependent  and  fully  describe  real 
situations  occurring  in  the  computer-aided  medical 
diagnosis  tasks. 

In  next  section  we  present  different  approaches  to  the 
construction  of  decision  algorithms  of  multiperspective 
recognition  (diagnosis). 

HI.  Methods 

A.  The  Probabilistic  Algorithms 

Now  we  assume,  that  the  vector  of  features  and  the 
class  number/^  (k=  1 ,2,...,/V)  for  the  /c-th  recognition  task  of 
the  pattern  being  recognized  are  observed  values  of  random 
variables  and j^,  respectively.  Probability  distribution  of 

xk  Jk’  k=\ given  by  a  priori  probabilities  pjk  and 
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class -condintional  density  functions  ( CDF)  fjk  (xk )  .  Using 

the  Bayes  decision  scheme  [3],  we  can  receive  the  following 
recognition  algorithms  for  considered  concepts  of 
multiperspective  diagnosis: 

Bayes  algorithm  for  direct  approach  (Bayes- A) 

\| =  (D 

N  N 

if  £/%/*)=  ,  max  Y, P('k/x )> 
k= 1  <1.'2 WJfc=l 


3.  Assign  the  recognized  pattern  into  the  class  j'f  . 
Decomposed  dependent  multiperspective  NN  algorithm 
(NN-D)  (for  the  k-th  task) 

Notation:  i  j  ,i2>—^k- 1  '  results  °f  recognition  for  the  first  A- 1 
tasks. 

1.  Measure  features  of  the  pattern  to  be  recognized, 

2.  Find  the  nearest  neighbour  (in  the  space  X  j()  to  x/(  from 

among  learning  patterns  for  which  jn  consists  at  the  first  k- 1 
positions  the  sequence  0et  say  xk ), 


where  a  posteriori  probabilities  p(ik  lx) can  be  calculated 
from  the  Bayes  formula. 

Bayes  algorithm  for  decomposed  independent  approach 
(Bayes- 1)  (for  k-th  task) 

(xk )  =  ik  if  p(ik  lx)  =  max  p(ik  / x)  (2) 

'k 

Bayes  algorithm  for  decomposed  dependent  approach 
(Bayes-D)  (for  k-th  task) 

\\t(BD\xk,il,i2,-Jk-i)=ik  if  (3) 


Pdk =  mP P(k  /xk,ii,i2’"’ik-l)  ■ 

‘k 

In  the  real  world  there  is  often  a  lack  of  exact  knowledge 
of  a  priori  probabilities  and  CDFs.  For  instance,  there  are 
situations  in  which  only  a  learning  sequence 

S  L=(xl  (4) 

Xk  =  (*f  >  x2  ,—xN  ),  jk  =  (j\  ,  J2  >— .  7&  )  » 

that  is  a  set  of  case  records  with  the  firm  diagnosis  is  known. 
Now  conceptually  simple  method  is  to  estimate  unknown  a 
priori  probabilities  and  CDFs  and  then  to  use  these 
estimators  to  calculate  a  posteriori  probabilities. 

B.  Nearest  Neighbour  Algorithms 

Nearest  Neighbour  (NN)  algorithm  is  very  well  known 
classification  technique  [3]  using  learning  set  (4).  Its 
modified  versions  for  three  different  concpts  of  MC,  can  be 
formulated  as  follows. 

Direct  multiperspective  NN  algorithm  (NN-A) 

1.  Measure  features  x  of  the  pattern  to  be  recognized, 

2.  Find  the  nearest  neighbour  (in  the  space  X )  to  x  from 

among  learning  patterns  (let  say  xm) , 

3.  Assign  the  recognized  pattern  to  the  sequence  of  classes 

jm  • 

Decomposed  independent  multiperspective  NN  algorithm 
(NN-I)  (for  the  k-th  task) 

1.  Measure  features  of  the  pattern  to  be  recognized, 

2.  Find  the  nearest  neighbour  (in  the  space  X  f)  to  x/(  from 

among  learning  patterns  (let  say  x'k  ), 


3.  Assign  the  recognized  pattern  into  the  class  jk  . 

C.  The  Fuzzy  Methods 

Now  we  take  to  decision  algorithms  for  the  MC  using 
the  inference  engine  that  makes  inferences  on  a  fuzzy  rule 
system.  For  all  the  algorithms  presented  below  there  is  a 
common  rule  form  for  rules  that  associate  an  observation 

vector  a  =  (a(l) ,ai2) )  with  a  diagnosis.  Further,  we 
assume  the  following  general  form  of  the  k-th  rule  in  the 
system  (k  =  1,2,...,  K): 

IF  amisAu  AND. ..AND  a{n)isAn  k  THEN  b  is  Bk  ,  (5) 

where  A;  k  are  fuzzy  sets  (whose  membership  functions  are 
designated  bypA;  k  )  that  correspond  to  the  nature  of 
particular  observations  (for  simplicity  we  assume  the  sets  to 
be  triangular  fuzzy  numbers)  whereas  Bk  is  a  discrete  fuzzy 
set  defined  on  the  diagnosis  set,  with  the  p  /;/  membership 
function. 

The  particular  decision  algorithms  to  be  used  in  MC 
have  in  common  both  the  inference  engine  and  the 
procedure  for  rule  system  (5)  derivation  from  the  learning 
set  (4).  The  only  difference  between  the  algorithms  is  the 
form  of  observation  vector  a  and  diagnosis  b,  what  follows, 
the  number  and  nature  of  premises  and  conclusion  in  the 
rules  (5).  Let  us  first  briefly  present  both  of  the  common 
procedures. 

The  Inference  Engine  Procedure — composed  of  the  three 
following  steps: 

1.  For  the  k-th  rule  we  detemiine  the  degree  of  fulfilment 
factor  for  a  given  observation  a: 

vk(a)=  min  | it;*,  (a,).  (6) 

i=l,2,..„ L 

2.  Combination  of  fuzzy  rules — we  detemiine  the  resultant 
consequence  in  the  form  of  a  fuzzy  set  B  using  the  principle 
of  maximum  combination  of  responses: 

\iB(x,a)=  max  vk(a)pB  (x).  (7) 

*=1,2 . L  K 

3 .  Defuzzification  —  as  we  deal  with  discrete  conclusions, 
we  use  defuzzification  by  maximum,  namely: 

j(a)  =  max  pB(x,a) . 

X 


(8) 


A  Procedure  for  Deriving  the  Rule  Systems  from 
Training  Sets.  We  have  accepted  here  the  modified  b-cut 
algorithm  [4]that  is  composed  of  the  following  steps: 

1.  Find  extremes  for  each  variable  in  the  training  set 
(/  =  l,2,...,n  ): 


AD  = 


(D  AD  _ 


=  max  a  -  ,a  .  =  mm  a  ■ 


M) 


(=1,2,...,  Z. 


(=1,2 . L 


2.  Find  the  triangular  fuzzy  number  (a^}  ,aP,a!^+)  for 
the  Z-th  variable  and  the  k- th  rule  corresponding  to  the 
diagnosis  jfce  M  : 

a(.l]  =  min  {a  ^  such  thatj'=fc}, 
k  (=1,2 . L  ' 


( D+ 

a,  =  max  \a 
i=l,2 . L 


{a^'such  thatj'!  =  k] , 


AD 


— y  {a^such  that  j'  =  £}> 


(/) 


where  Nk  is  a  number  of  such  elements  that  j'  =  k  . 

3 .  Calculate  for  each  variable  the  relative  diameter  of  the 
support: 


8/  =  min 

feM 


„(/)+ 


-  a 


(/)- 

k 


,(o 


-  a 


(/) 

min 


Variables  with  8;  >  A  are  not  considered  to  belong  to  the 
rule  system.  The  other  ones  will  constitute  the  selected  set  of 
n  explanatory  premises. 

The  fuzzy  algorithm  of  MC  for  direct  approach  (Fuzzy  A) 


Now  we  have  one  system  of  fuzzy  rules,  where  a  =  x  and 
*=0'l.J2.-..,i'jv)- 

The  fuzzy  algorithm  of  MC  for  decomposed  independent 
approach  (Fuzzy  I) 

In  this  approach  there  are  N  sets  of  fuzzy  rules  for  each  task 
of  MC.  In  rules  for  Zc-th  task  a  =  xk  and  b  =  jk. 

The  fuzzy  algorithm  of  MC  for  decomposed  dependent 
approach  (Fuzzy  D) 

As  previously,  but  now  a  =  ( xk ,  /j ,...,  jk_\ )  and  b  =  jk. 


D.  The  Neural  Network  Approach 

Similarly  to  the  fuzzy  approach,  applying  artificial 
neural  networks  as  an  implementation  of  the  decision 
algorithm  for  MC  is  concerned  exclusively  with  the  relevant 
selection  of  input  data. 

Two  kinds  of  neural  networks  have  been  accepted  for  the 
needs  of  a  comparative  analysis,  namely  the  Back 
Propagation  (BP)  and  Counter  Propagation  (CP)  networks 
[5]. 

The  Type  BP  Network 

The  network  consists  of  the  input,  hidden,  and  output 
neuron  layers.  The  input  layer  plays  the  role  of  a  data  buffer 
so  that  the  data  are  normalized  to  belong  to  the  [0,  1]  range. 
There  have  been  various  numbers  of  input  layer  neurons, 
depending  on  actual  quantities  of  data.  Both  the  hidden  and 
output  layer  neurons  have  the  sigmoid  transition  function; 
the  corresponding  layers  are  trained  by  means  of  the  error 


back  propagation  method.  Neurons  of  the  successive  layers 
are  connected  on  the  each-to-each  basis. 

The  Type  CP  Network 

The  network  has  three  layers  of  neurons.  Just  as  it  was  the 
case  in  the  BP  network,  the  input  layer  normalizes  input 
vectors  to  remove  analysis  errors  that  are  typical  for  the  CP 
networks.  The  second  layer,  trained  according  to  the 
Kohonen  algorithm,  possesses  an  empirically  estatblished 
number  of  neurons  that  depends  on  input  data  quantities  and 
ranges  within  the  limits  from  50  to  70  neurons.  The  output 
layer  has  been  trained  using  Grossberg's  outstar  method. 
Neuron  layers  are  connected  on  the  each-to-each  basis. 

The  input  data  sets  are  just  the  same  as  those  for  the 
fuzzy- approach  algorithms.  Thus  the  BP-A  and  CP-A 
designations  correspond  to  type  BP  and  CP  networks, 
respectively,  for  direct  approach  of  MC,  ie.  with  input 
vector x  and  output  (yj ,  j2 jN)-  Further,  BP-I,CP-I,BP- 
D  and  CP-D  algorithm  designations  denote  the  relevant 
networks  used  for  decomposed  independent  and  decoposed 
dependent  approaches,  respectively. 

All  the  decision  algorithms  that  are  depicted  in  this 
chapter  have  been  experimentally  tested  as  far  as  the 
decision  quality  is  concerned.  Measure  for  the  decision 
quality  is  the  frequency  of  correct  diagnoses  for  real  data 
that  are  concerned  with  multiperspective  recognition  of 
chronic  renal  failure  and  decisions  in  non-Hodgkin 
lymphoma..  The  purpose  of  our  research  and  associated  tests 
was  not  only  the  comparative  analysis  of  the  presented 
algorithms  but  also  answering  the  question  whether 
decomposed  dependent  approach  would  yield  a  better 
decision  quality  as  compared  to  decomposed  independent 
method.  The  next  chapter  describes  the  performed  tests  and 
their  outcome. 


IV.  Practical  Examples 

A.  Diagnosis  of  chronic  renal  failure  in  children 

Chronic  renal  failure  (CRF)  is  a  syndrome  of  of  clinical 
symptoms  caused  by  the  adverse  action  of  a  factor  or  factors 
of  the  urinary  tracts.  It  is  characterized  by  the  failure  to 
eliminate  the  final  products  of  nitrogen  metabolism,  acid- 
base  imbalance,  failure  to  maintain  the  water-electrolyte 
balance  and  a  damage  to  the  endocrine  function  of  the 
kidneys. 

The  complete  diagnosis  of  CRF  contains  the  two 
following  task: 

■  recognition  of  etiologic  type  of  CRF 
(glomerulonephritis,  pyelonephritis,  renal  calculosis, 
renal  defect,  metabolic  disorders,  CRF  as  a  result  of 
systemic  disease), 

■  recognition  of  CRF  phase  (decadent,  compensated, 
and  decomensated  CRF). 

The  vector  of  features  contains  the  values  of  53  clinical  data 
presented  in  Table  1. 

In  the  Department  of  Pediatric  Nephrology  the  set  of  380 
case  records  of  children  suffering  from  chronic  renal  failure 
were  collected,  which  constitute  the  learning  set  (4).  Results 


of  experimental  investigations  of  MC  algorithms  for 
different  approaches  are  presented  in  Table  3. 

Table  1 

CLINICAL  FEATURES  CONSIDERED 
GENERAL:  Age,  Sex 

PAST  AND/OR  CONCURRENT  DISEASES:  Organic  heart  disease. 
Nephrosis,  Surgery  (abdominal),  Abdominal  tumor.  Streptococcal 
pharyngitis.  Scarlatina,  Urinary  tracts  lithiasis 

ANAMNESIS:  Fever,  Hereditary  transmission,  Diarrhoea,  Vomiting, 
Convulsions,  Growth  of  signs 

PHYSICAL  EXAMINATIONS:  Condition  of  child.  Skin,  Blood 
pressure  (syst.  and  diast.).  Extravasation,  Oedema,  Pulse,  Body 
temperature.  Urine  in  bladder 
LABORATORY  EXAMINATIONS:  Sedimentation  rate 
GASOMETRIC  EXAMINATIONS:  p  02,  p  C02,  pH,  BE 

MORPHOLOGY:  Leucocytes,  Reticulocytes,  Trombocytes, 

Erytrocytes,  Hemoglobin 

SERUM:  Urine  level,  Total  protein  level 

SERUM  IONOGRAM:  Na+,  K+ 

URINE:  Micturition,  Intermittent  diuresis,  Haematuria,  Erythrocyturia, 
Erythrocytes,  Protein,  Cylinders 

OTHERS:  Pain,  Urine  culture,  Toxins,  Arrhythmia,  Inflammation  of 
heart  muscle.  Infection  of  respiratory  tracts,  Dyspnoea,  Pulmonary 
oedema.  Dehydration 

B.  Decision  problems  in  non-Hodgkin  lymphoma  {NHL) 

The  non-Hodgkin  lymphoma  is  a  common  dilemma  in 
hematologic  practice.  It  provides  a  physician  with  many 
difficult  decision  problems  which  can  be  treated  as  a 
multiperspective  recognition  presented  in  Table  2. 

Table 2 


MULTIPERSPECTIVE  RECOGNITION  IN  NON-HODGKIN  LYMPHOMA 


Task 

Decision 

Features 

1 

Lymphoma  type 

(small  degree  of  malignancy,  moderate 
degree  of  malignancy,  immunoblastic 
lymphoma,  lymphoblastic  lymphoma, 
Burkitt-type  lymphoma) 

cytologic  examinatons 
of  cells,  lymphoma 
structure  (e.g.  diffuse, 
nodulous,  etc.) 

2 

Stage  of  lymphoma  development 
(first,  second,  third  and  fourth  stage  of 
lymphoma  development) 

Nodes,  tissues  and 
organs  affected 

3 

Lymphoma  form 

(form  A  -  no  additional  features , 

form  B  -  with  additional  features) 

Additonal  features 
(fever,  insomnia, 
diaphoresis,  etc.) 

4 

Scheme  of  treatment 

CHOP,  BCVP,  COMBA,  MEVA,  COP, 
CHOP-BLEO 

Decisions  i^,  i2,  and  ig 

5 

Modification  of  scheme 
(recommendations  for  grading  of  acute 
and  subacute  toxicity) 

Side  effects  (hair, 
infection,  cardiac 
rhythm,  pain,  etc.) 

In  order  to  study  the  performance  of  the  proposed 
recognition  concepts  and  evaluate  their  usefulness  to  the 
computer-aided  decisions  in  non-Hodgkin  lymphoma  some 
computer  experiments  were  made  using  the  set  of  225  case 
records  collected  in  the  Wroclaw  Medical  Academy.  The 
outcome  is  shown  in  Table  3. 

V.  Conclusion 

Results  of  experiments  imply  the  following  conclusions: 

1.  Out  of  all  the  above-mentioned  approaches  to 
mu Iti perspective  medical  diagnosis,  the  best  outcome  is  that 
achieved  as  a  result  of  using  the  type  Back  Propagation 


neural  network.  The  probabilistic  algorithm  yields  a  little 
worse  results.  Still  worse  are  the  results  obtained  from  the 
Counter  Propagation  neural  network  whilest  the  fuzzy  logic 
algorithms  turn  out  to  be  the  undoubtedly  worst  ones. 

2.  There  occurs  a  common  effect  within  each  algorithm 
group:  algorithms  for  decomposed  independent  approach  are 
always  worse  than  rules  for  direct  and  decomposed 
dependent  concept. 

The  comparative  analysis  presented  above  for  the 
multiperspective  diagnosis  algorithm  is  also  of  the 
experimental  nature.  We  have  carried  out  a  series  of 
experiments  on  the  basis  of  a  specific  exemplar  that 
concerns  chronic  renal  failure  and  non- Hodgkin  lymphoma 
diagnosing  using  rich  enough  set  of  real-life  data.  The 
objective  of  our  experiments  was  to  measure  quality  of  the 
tested  algorithms  that  was  defined  by  the  frequency  of 
correct  decisions.  The  algorithm- ranking  outcome  cannot  be 
treated  as  one  having  the  ultimate  character  as  a  that  of  a  law 
in  force,  but  it  has  been  achieved  for  specific  data  within  a 
specific  diagnostic  task.  However,  although  the  outcome 
may  be  different  for  other  tasks,  the  presented  research  may 
nevertheless  suggest  some  perspectives  for  practical 
applications. 

Table  3 

Results  of  accuracy  of  computer-aided  diagnosis  of  chronic 

RENAL  FAILURE  (CRF)  AND  DECISIONS  IN  NON-HODGKIN  LYMPHOMA 
(NHL)  FOR  DIFFERENT  ALGORITHMS  AND  CONCEPTS  OF  MULTIPERSPECTIVE 
RECOGNITION 


Approach 

Algorithm 

CRF 

Accuracy  [%] 

NHL 

Accuracy  [%] 

Direct  approach 

Bayes  -  A 

90,3 

83,9 

NN-A 

80,8 

78,3 

Fuzzy  -  A 

71,3 

74,2 

BP- A 

92,3 

85,3 

CP -A 

85,7 

81,2 

Decomposed 

independent 

approach 

Bayes  - 1 

71,4 

68,4 

NN-I 

65,3 

59,3 

Fuzzy  - 1 

55,3 

50,8 

BP  - 1 

73,8 

69,3 

CP- 1 

68,4 

66,5 

Decomposed 
dependent  approach 

Bayes  -  D 

86,4 

81,6 

NN-D 

75,6 

76,3 

Fuzzy  -  D 

69,4 

70,5 

BP-D 

89,6 

84,1 

CP-D 

82,6 

80,9 
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